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The present invention relates generally to telecommunications, 
Existing telephony systems suffer from a number of problems including 
on fixed hardware which results in long time to bring new products to 
personal computers, without the network performing any processing of 
The invention provides for an intelligent server which executes a separatle 
available to that user. This allows for an open and flexible teleconference 



more specifically, to a system and method of teleconferencing, 
system complexity, limited access and implementation of services 
Internet applications implement teleconferencing on end user 
lata streams or guaranteeing quality of service in the transmission, 
mixer for each user, where the mixer is dedicated to the resources 
system which can operate over multiple networks. 
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Method and System of Teleconferencing 

Field of Invention 

The present invention relates generally to telecommunications, and more 
5 specifically, to a system and method of teleconferencing. 



applications 



Background of the Invention 

Teleconferencing systems allow 
they were in the same room. In spite of the 

1 0 these systems, they are commonly used for 
resulting reductions in travel time and cost, 
not generally be rationalized in other 
so teleconferencing is not common in these 
Traditional teleconferencing systems 

1 5 monophonic speaker arrangements at each 
teleconference, and the methodology was tc 
participants, blocking the remainder of the 
evolving and systems are now available 
signals of the participants and stereo sound 

20 present even greater demands on the carrie 
and lower latency, which results in even 
explains the limited availability and use of 
Generally, each teleconferencing 
specific communication network. Presently, 

25 dominant: the public switched telephone 
These systems are typically composed of 
personal computers, an access network sucfi 
link, and a backbone network such as the 
or the intercity data networks. Although the 

30 greatly, the backbone networks must handle 
operate reliably and efficiently. Therefore, 
focussed on the provision of single services 
incentive for telephone companies to offer 
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peo Die at different locations to converse as if 
currently high cost and complexity of 
business applications, because of the 
However, the cost and complexity can 

such as academia and private use, 
lareas. 

consisted of single microphone and 
physical location participating In the 
broadcast the loudest voice to all other 
voices. However, the art has been 
offer such added features as video 
Generally though, these new features 
networks in terms of higher bandwidth 
cost and complexity. This largely 
advanced teleconferencing systems, 
is designed to be used with a 
two communication networks are 

for voice, and the Internet for data, 
equipment such as telephones or 
as a telephony local loop or a radio 
switched telephone network (PSTN) 
leeds of users at the terminals vary 
highly standardized loads in order to 
traditional communication networks 

;ather than differentiation. There is no 
varied features or to serve small niche . 
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markets as the revenues would not offset the substantial cost of developing and 
implementing these additional products. 

In voice telephony, services are impler nented by having large computer 
programs running on centralized switches whi :h interrogate local and distributed 
5 databases. The local databases specify which features are enabled on a given line, 
the switch software interprets these feature lists and implements the switch 
behaviour, and the switch software also interrogates the distributed databases via 
Common Channel Signalling^System No. 7 (S37) queries. SS7 is a global standard 
for telecommunications that defines the procedures and protocol by which network 

1 0 elements in the public switched telephone net\ vork (PSTN) intercommunicate control 
messages for basic call setup, management, £ind tear down, as well as for special 
intelligent or database services such as local number portability (LNP), toll-free 
(800/888) services and call forwarding. - 

In PSTN, a user only has access to services provided by the local exchange 

1 5 carrier, which in turn may only function within t hie bounds of the SS7 protocol. 

Therefore, users can only access the switches in a limited way, and new features can 
not be added by outside parties. 

Telephony features, such as teleconferencing, may only be implemented by 
adding code to the programs running the switc nes or by adding specialized hardware 

20 to the telephony network. The features available to particular users are defined in the 
local databases accessed by the switch softWc re, and adding a new type of feature 
may involve changing these databases together with the switch software that uses 
them, and may also involve purchasing and installing new types of hardware in the 
network. 

25 This limits the speed with which new features can be introduced since new 

hardware and software m ust be designed, test 3d, manufactured and deployed. The 
inflexible assignment of tasks also makes it imoossible to share loads between 
different types of hardware, for example to use idle tone-decoding hardware to help 
with an overload of voice-conferencing or to provision a new teleconferencing 

30 feature. 

A traditional PSTN teleconferencing system provides each user with a 
bidirectional audio communication link with each of a plurality of remote transceivers. 
Typically, the system includes a microphone ai each location for producing an audio ( 
signal from that location and a transport network such as the public switched 
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telephone network (PSTN) which deiiverien each voice signal to a conference bridge 
This conference bridge mixes the voice signals and returns them to audio amplifiers 
and speakers at each location. 

The conference bridge Is implemented as a new hardware component 
connected to the switch providing the servise. Adding a new feature such as Dolby 
noise reduction or bass boosting requires £ physical change to the hardware and/or' 
software in every switch that offers the service. 

Changes to existing telecommunication networks are therefore very 
complicated to make. There is a rigid modsl and hardware structure is difficult to 
extend, so existing telephone companies a 1 

When they do develop new products they Inevitably take a long time to bring to 
market and are expensive to implement. 

Telecommunications systems need [to process the data flowing through in 
complex ways, often with processing occuning on computer systems separated both 
geographically and administratively. Many communications paths are simultaneously 
active, and the processing applied to the vs rious flows of data changes frequently 
and in a wide variety of ways. The software needed to control these computer 
systems is generally large, complex and dif IcuK to change. 

The complexity of present telecomrr unications systems software, and the 
extensive interactions between its software 

new features very difficult. As well, telecorr munication services have traditionally 
been provided by large monopolies who err ployed proprietary equipment that only 
they had access to. Large telephone comp anies hesitate to allow open access to the 
control of their switches and servers due to the risk of failures and the resulting 
damages that would occur; therefore, only v ery limited access is allowed 



companiesis therefore limited to a 
reduces the talent pool available and 



Software development for telephone 
"closed" group of trusted developers, which 
shuts out developers with new ideas for niche markets. 

In summary, problems with the PSTN include: 

1 . system complexity results in long tirr e to bring new products to market; 

2. cost of services results in focus on fcsw specific services rather than diversity 
and niche markets; 
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3. existing services are provided by dec icated hardware and software which are 
inflexible and must be physically, anc often manually, modified to offer new 
services or features; and 

4. only proprietary access to switches and their software code is allowed. 
The implementation of software applications in an Internet environment is 

generally done by the software running at thi * endpoints, and the IP (Internet 
Protocol) network is treated merely as a con Jurt for transfer of data packets between 
the two points. The routers in the IP network merely Index internal routing tables 
using the address of data packets so that thoy know how to forward them, and do not 
generate data for either of the endpoints, or i eact to instructions from either of the 
endpoints. The Internet itself may be envisic ned as a series of routers 
interconnected by an Internet backbone network designed for high-speed transport of 
large amounts of data. Users may access the internet using personal computers in a 
number of manners including modems connected to the Public Switched Telephone 
Network (PSTN), or set top boxes connecteq to existing telephone or television cable 
networks. 

Communications over the Internet cah be administered using various 
protocols, over a variety of physical transfer media. A protocol is a set of conventions 
or rules that govern transfer of data between I 

protocols define only a hardware configuration while more complex protocols define 
data formats, error detection and correction techniques and software structures. 

The key advantages of a protocol like 
function efficiently and that it offers a standardized means by which applications 
software can use that network. The main disadvantages are: 
1 . that it does not allow processing to be | 



performed on data streams; and 
2. that it does not allow quality of service to be specified. 

For example, the Internet generally will not let a user run an applet on a node 
or server. This limitation is due to the architecture of the Internet which is based on 
the international OSI (Open Systems Interconnection) standard. The OSI standard 
30 describes communication systems using a seven layer model, each layer being 

operable to perform certain functions. Although OSI is not always strictly adhered to 
in terms of keeping related functions together in a well-defined layer, most 
telecommunication products make an attemp : to place themselves in relation to the 
OSI model. The OSI standard is not likely to| change dramatically, nor is the 
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it adds to backbone networks and the 
;ed. As well, little is accomplished 
on are responsive to the protocol, which 



Internet's use of the standard, so the Internet will not likely become an active 
component in the provision of telecommunication services. 

More importantly, the Internet does n at allow quality of service to be specified. 
Internet communications generally rely on th 3 transport of data packets over various 
heterogenous networks, so even though certain links may have predictable data 
rates, for example, a privately owned T1 line total end to end transfer rate is still not " 
predictable or dependable. 

Some protocols such as resource reservation protocol (RSVP) set tags and 
priorities which can influence the routers on fin Internet path a little, but not a great 
deal. The RSVP is an extension to IP that pe irmfts specification of quality of service 
at a technical level, in terms of parameters such as data rates and latencies. It has 
had limited acceptance due to the complexity 
need for their switching hardware to be upda 
unless all switches in the end to end connect 
is not generally the case. 

Therefore, typical software applications operating over the Internet, such as 
teleconferencing, look at the Internet as simp y a transport network without any 
processing capability and all functionality is p aced at the participant's locations. 
Implementations of teleconferencing over Internet, for example, have software at 
each user's personal computer (PC) that acts as the interface with the user, 
converting voice to data packets for IP transmission to each of the other participants 
in the teleconference. Accordingly, the user's PC also receives streams of voice data 
from each of the other participants in the tele< inference and plays them through a 
sound card. 

This implementation suffers from seve re scalability problems. For example, if 
there are ten participants in a teleconference, then each participant would require 
sufficient bandwidth to download nine simultaieous voice data streams from the 
other participants, in real time. As the bandwdth to each user would increase linearly 
with the number of participants, and the load - >n the network increase with the square 
of the number of participants, there would be an immense load on the network 
resources. Clearly, this is impractical for teleconferences with a large number of 
parties or sen/ices which themselves require high bandwidth such as video or high 
quality voice. Even if the bandwidth could be obtained, there is no way to ensure that 
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it is consistently available, as there is no 
Internet applications. 

As noted above, typically, each existi 
to operate over a particular network and is 
varied networks now available. These 
network (PSTN), Internet, cellular telephone 
area networks (LANs) and wide area 
are a variety of media including optical fibre, 
connections, which execute communications 
digital format using a variety of different 
been widely implemented, at considerable 
be quickly abandoned and a new, standard, 
constructed. Therefore, there is a need for s 
implementing teleconferencing over a mixed 
networks. 

Asynchronous Transfer Mode (ATM) 
protocols for addressing packets of data and 
typically been deployed in the core of 
speeds at which ATM equipment operates 
accessible and because of the complexity of 
these mechanisms have not been used by 

Besides the IP and ATM networks 
networks such as Frame Relay and Ethernet 
carry data, for example using trellis coding 
signal commonly; which is commonly used 
are also evolving of each major type of 
between implementations of these networks 
complexity induced by this variety makes it 
to exploit all the networks available, and to 

Feature development is already dr 
teleconferencing over voice networks. As 
messaging, shared files and whiteboards are 
products, and new applications such as 
Internet gaming, develop, the problem is 



way to specify quality of service (QoS) in 



not< 



new 



ng teleconferencing system is designed 
capable of cooperating with the many 
networks include public switched telephone 
systems, satellite communications, local ' 
networks (WANs). Within these networks there 
wireless or hardwired electrical 
over these networks in analogue or 
protocols. Many of these networks have 
capital cost, so it is unlikely that they will 
world-wide telecommunications network 
system which is capable of 
combination of communications 

networks, for example, use standard 
setting up connections, and have 
backbdne networks because of the high 
Eiecause ATM routers are not directly 
their mechanisms for describing QoS, 
applications software, 
mentioned above, there are other data 
As well, the PSTN may also be used to 
maps digital data onto an analogue 
Personal Computer modems. Variants 
netwcfrk, and engineering differences 
tfesult in different performance. The 
difficult for users and application software 
any to its fullest extent, 
difficult for the simple application of 

media such as videophone, typed 
mixed with traditional teleconferencing 
distance learning, Internet Relay Chat and* 
becoming even more severe. This probleni 
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will grow even greater as expectations develop for features from one domain to be 



mapped into another, as when customers 



sxpect a feature similar to call-waiting to 



apply in videoconferencing or Internet ganr ing. 



Furthermore, even for a single app 



needs, for example, requiring different degrees or forms of encryption; Therefore, 
there is a need for a system which can allow many cases and features without 
becoming complex, slow to develop and sbw in operation. 

There is therefore a need for a method and system of teleconferencing that 
may be implemented over mixed telecommunications networks, and addresses the 
complexity of such existing networks to provide an open, scalable and flexible 
architecture. 
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> input 



Summary of the Invention 

It is therefore an object of the invention 
teleconferencing which addresses the problems 

One aspect of the invention is broadly 
teleconferencing comprising: three or moro 
input and an audio output; a telecommunications 
terminals and operable to transport data tc 
modular mixing software for each respective 
telecommunications network, and operablo 
the audio outputs of the others of the user 
audio signals into a signal for the audio 
correlates to the needs of the respective 

Another aspect of the invention is 
comprising: means for interconnecting us$r 
from the user terminals; means for executing 
each respective user terminal, the separat 
means for receiving separate audio signal^ 
user terminals; and means for combining 
the audio input of the respective user terminal 
respective user terminal. 

An additional aspect of the invention 
teleconferencing comprising the steps of: 



to provide a method and system of 
described above, at least in part, 
defined as a system for 
user terminals, each having an audio 

network interconnecting the user 
and from the user terminals; separate 

user terminal, executing on the 
: to receive separate audio signals from 
terminals; and to combine the separate 
of the respective user terminal which 
terminal. 

Refined as: a server for teleconferencing 
terminals and transporting data to and 
separate modular mixing software for 
2 modular mixing software including: 
from the audio outputs of the others of the 
separate audio signals into a signal for 
which correlates to the needs of the 



userl 



tie 



is defined as: a method of 
ifeceiving, at a separate modular mixer 
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user terminals and executing on a 



representing a respective one of three or more 
telecommunications network, separate audio signals from audio outputs of the others 
of the user terminals; and combining the separate audio signals into a signal for an 
audio input of the respective user terminal which correlates to the needs of the 
respective user terminal. 

A further aspect of the invention is defirfied as: a computer data signal 
embodied in a carrier wave, said computer data signal comprising a set of machine 
executable code being executable by a computer to perform the steps of: receiving, 
at a separate modular mixer representing a respective one of three or more user 
terminals and executing on a telecommunications network, separate audio signals 
from audio outputs of the others of the user teiminals; and combining the separate 
audio signals into a signal for an audio input of| the respective user terminal which 
correlates to the needs of the respective user 1 

A still further aspect of the invention is < 
storage medium storing a set of machine executable code, the set of machine 
executable code being executable by a computer server to perform the steps of: 
receiving, at a separate modular mixer representing a respective one of three or 
more user terminals and executing on a telecommunications network, separate audio 
signals from audio outputs of the others of the i 
separate audio signals into a signal for an aud o input of the respective user terminal 
which correlates to the needs of the respective user terminal. 



erminal. 

defined as: a computer readable 



will become more apparent from the 
to the appended drawings in which: 
system in a broad manner 



Brief Description of the Drawings 

These and other features of the invention 
following description in which reference is macje 
Figure 1 presents a physical layout of a teleconferencing 

of the invention; 

Figure 2 presents an exemplary physical layojjt of a teleconference system in a 

preferred embodiment of the invention; 
Figure 3 presents a block diagram of exemplary 

preferred embodiment of the invention 
Figure 4 presents a block diagram of an exemplary 

a preferred embodiment of the invention. 



signal processing software in a 
and 

operating system architecture in 
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I output 



Detailed Description of Preferred Embod 

A system which addresses the objects 
physical layout in Figure 1. This figure 
between three or more user terminals 12, 
5 audio output. The phrase "terminal" is used 
suitable manner of user audio input and 
telephones, and personal computers with 
The audio input and output refer to the 
telecommunications network, and not to the 

10 terminal. 

A telecommunications network 18 Int 
16 and has the necessary functionality to 
telecommunications network 18 also executes 
each respective user terminal 12, 14 , 16. 

1 5 operable to receive separate audio signals 
to combine those separate audio signals intc 
transported to the audio input of the respecti 
correlates with the needs of that respective 
That is, if there were three participan 

20 three mixers, a first mixer for participant A, 
participants B and C, a second mixer for 
third mixer for participant C, which mixes A 
The use of individual mixers 20, 22, 
network 18 addresses a number of the 

25 Firstly, having individual mixers 20, 

to its own user and to be tailored to the 
resources of the network and network 
example, the mixer of a user having a high 
quality stereo to its user, with balanced mixing 

30 user connection via an analogue, monophon 
strongest voice to its user, blocking voice 
reducing noise. 

Having a single mixer for all participants 
require an immense piece of software code i 
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ments of the Invention 
outlined above, is presented as a 
presjents a system 10 for teleconferencing 
, 16, each having an audio input and an 
generally in the art to describe any 
device including telephones, cellular ~ 
microphones and speakers or headsets, 
connections between the terminal and the 
audio interface between the user and the 

rconnects these user terminals 12, 14 , 
transport data packets between them. The 
separate modular mixing software for 
separate mixers 20, 22, 24 are 
each of the other user terminals and 
one signal. This one mixed signal is 
user terminal, in a manner that 
liser terminal. 

s in the teleconference, there would be 
mixes the audio output signals of 
participant B, which mixes A and C, and a 
B. 

executing on the telecommunications 
problems noted above. 

:, 24 allows each mixer to be dedicated 
limitations of the user's resources and the 
connexion that services that user. For 

bandwidth connection may provide digital 
of all participant's voices. Another 
c, PSTN connection may send only the 
signals from other participants and thereby 

, as taught in the PSTN art, would . 
all of the variations in user 



Tiese: 



from i 
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over existing Internet based 
methods broadcast all voice streams to ~ 



requirements were to be handled in a single 
software code would be unmanageably largs, complex and slow, existing systems 
have simply not offered such diverse services. 

Similarly, the use of mixers 20, 22, 24 executing on the telecommunications 
network 18 offers a substantial improvemen 
teleconferencing as well. Typically, Internet 
all participants, so each terminal receives up to (N - 1) streams where there are N 
participants. This places a tremendous demand on the bandwidth of the final 
connection to each user and a tremendous pad on the network, in contrast, the 
invention requires only the number of audio 
audio output, to be sent to the user. That is 
only one channel is required, and if stereo is 



sound, surround sound, central bass and other audio arrangements would require 



corresponding numbers of audio channels, 
required to each user and the loading on the 
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piece of software. As this piece of 



channels that the user requires at his 
if the user desires monophonic output, 
desired, two channels. Quadraphonic 



This greatly reduces the bandwidth 
network. 



to the provision of a system that is 
become more apparent from the 



Thirdly, it is also significant that the mixers of the invention are implemented in 
a modular manner. As will be described in £ reater detail hereinafter, all of the 
software components of the invention are implemented in small modules. Having 
small modules designed to handle very specific tasks results in a far simpler system 
than those like the existing PSTN. The moro defined the task that a module 
addresses, the easier it is to design that module and later, to integrate it into the rest 
of the software system. This is fundamental 
flexible and open. 

Other advantages of the invention wi 
description of the preferred embodiment whi^h will be presented in terms of an 
example 

Figure 2 presents an exemplary physical layout of a teleconference in the 
preferred embodiment of the invention, havirjg four participants. Two participants 
have direct access to an active network, whi 
term "active network" refers to a network tha 
software and other related software components described hereinafter. 

Participant A has a personal computer (PC) 26 connected to a first active 
network 28 via a wireless connection 30. The entity on the active network 28 which 
serves Participant A is called a NetPort 32. Jhe specific role of the NetPort 32 within 



e two are connected to the PSTN. The 
is operable to execute the mixer 
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the active network 28 will be described in greater detail hereinafter. The PC 26 is 
running a stereo-enabled Web browser with a RealAudio plugin that implements 
streaming audio and is output at speaker 34. The PC 26 also has a simple 
microphone plugin that passes samples fro ti the on-board microphone 36 back to an 
5 IP address. Participant A also has a WebCam 38 connected to her PC 26. 

Participant B is connected directly to a second active network 40, but In a 
location geographically remote from the first active network 28. Participant B also 
listens from a speaker 42 connected to his PC 44 through a streaming-audio 
application, but is talking through a wireless telephone 46 for its boom microphone. 

1 0 Participant B is plugged into a second NetPort 48 through a telephone jack and 
hardwired connection 60. 

Participant C is connected to the PCTS 52 (plain old telephone system) via a 
plain black rotary-dial telephone 64. 

Participant D has two speaker phonos 54, 56 fed by two separate POTS lines. 

15 He also has an Internet connection via a PC 60 which runs a Web browser, but his 
Internet Service Provider (ISP) does not provide good enough quality of service 
(QoS) for voice, so he just uses it for the gniphic user interface (GUI). 

A GUI is piece of software that presents data to users in a graphical manner, 
allowing for easy interpretation and modification. It is preferred that the invention be 

20 implemented in such a manner, where poss ble. The GUI runs as Java in the PC 
browser, and communicates with call processing applications running on the active 
network by means of sockets. Invoking it involves typing a URL (uniform resource 
locator such as "coolPhones.conrO, after which it sits in a window waiting for an 
incoming call or a user input event to place a call. Inputs can be made via a mouse, 

25 keyboard, trackball, touchscreen, joystick or other similar manner. The GUI is strictly 
an interface, though, since it is unacceptable for example, to have voice-mail fail 
when the PC is not active. Therefore, the real call processing decisions are made on 
the active network side. 

This exemplary system also includes an Internet network 62, which is 

30 connected to the PSTN using, for example, H.323 and SIP (Session Initiation 

Protocol) connections. These connections are known in the art, as are others. The 
Internet network 62 is also shown to be connected to both active networks 28, 40, but 
many other system topographies are also po ssible. The invention is not limited by 
any particular topography. 
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output 



currently speaking. 



due 



: full 



While the teleconference is in progress 
the left, Participant C in the middle and Part 
A is mixing the other participant's monophorjiic 
stereo spectrum. The use of the stereo 
5 aids in identifying which participant is 
noise levels to be tolerable to participants 
effect acknowledges that people are able to 
an environment where there is considerable 
have a means for identifying and focussing 

1 0 The use of stereo sound has been shown to 
Stereo sound can be synthesized in 
implementation, amplifier gain can be varied 
example: one participant may be played at 
all on the right, a second may be played with 

1 5 none on the left, while a participant may be 
More complex implementation of stereo may 
audio signal before playing it on one of the 
the sound takes to travel to the farther of the 
environment Such methods are generally 

20 Accordingly, Participant A's GUI scre&n 

B's picture on the left, Participant C's numbejr 
ID name on the right. In each case, the GUI 
information that it has available for each 
URL or telephone number. As well, "screen 

25 which identifies personal information about tfjie 
As well, via the GUI, Participant A can 
of the three participants on her screen to ens ble 
to see her through the WebCam. Participant 
the volume level at which they speak to her, 

30 adjust their stereo imaging. Alternatively, the 
governed by their physical location on the 
is away from an icon representing the user, 
preferred feature of the GUI is that when 
brighter then gradually fades again with inactivity 



■ GUI 



tie 



PCT/CA99/00875 



, Participant A hears Participant B on 
cipant D on the right, because Participant 
voice streams into a metaphorical 
has two major advantages. Firstly, it 
ing. Secondly, it allows higher 
to the "cocktail party" effect. This 
converse comfortably with one another in 
background noise, provided that they 
tjieir attention on a particular speaker, 
provide this identification, 
i number of manners. In a simple 
between the left and right channels, for 

volume on the left channel and none at 
full volume on the right channel and 
ilayed at equal volume in both channels. 
, for example, introduce a delay to the 
channels, simulating the additional time 

listener's two ears in a regular physical 
khown to those skilled in the art. 

shows a Web page with Participant 
in the middle and Participant D's caller- 
displays the best identification 

This identification may include a 
pop" information could be provided 
participant such as his address, 
click on "ear" and "eye" icons for each 
or disable their ability to hear her or 
A can also drag on a "mouth" icon to set 
and drag participants left and right to 
volume level of participants could be 
display - the further a participant's icon 
lower their volume level. Another 
speaks, their icon becomes « 



part cipant. 
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Similarly, Participant B hears ParticiDants A, D and C in a metaphorical stereo 
spectrum from left to right. Because he finds the sound of participant A's voice close 
to that of Participant C, he has chosen to separate them spatially as much as 
possible. This is done with the same type of GUI described with respect to 
Participant A above. 

Participant C hears a conventional four-way conference call, with the voices ~ 
of the three other Participants companded cind mixed together. As a result, she has 
difficulty distinguishing Participants B and D. However, she has the flexibility to tailor 
the call to some extent with the preference Tor single voice dominating, adding noise 
filters, or other functionality via her proxy. This addition of functionality directly to 
PSTN customers is very significant. As explained in the background, PSTN services 
are driven by a supply model that only provides commodity services, and takes a 
long time to provide those limited services. There is a vast PSTN infrastructure which 
provides single monophonic lines into millions of homes and businesses, all of which 
are shackled with these limitations. The use of proxies In the manner of the invention 
provides greater flexibility and access to new services which may be implemented 
quickly and at very low cost. More details a:* provided hereinafter regarding the 
preferred use of proxies. 

Participants who do not have the capability of interacting with the active 
network will have generic proxies assigned to them which are dictated by the nature 
of their telecommunication connections. For example, the system knows that a 
Participant only has PSTN access if that is tie connection the call manager has 
identified as the best connection during call setup. 

Participant D has a similar stereo arrangement, over which he has defined 
Participant A to reside on the left speaker te ephone, Participant D on the right 
speaker telephone and Participant C on both channels. This arrangement also 
creates a metaphorical stereo spectrum. Ot ler means are known in the art for 
carrying stereo over the PSTN, but such me hods generally require more complicated 
hardware at the Participant's end. Participant D has the same GUI as Participants A 
and B so he is able to control his proxy and mixer on the active network directly. 

As an example, exemplary signal processing software for participant A is 
presented as a Mock diagram in Figure 3. Voice streams from other participants 
arrive in different forms and need to be converted to a consistent form, companded 
and then mixed. They also need to be trans nitted across the radio link in an efficient 
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form, and other signal processing such as echo 
to be done on the voice data. In this case, RealAudio 
consistent form though MP3 or a number of 
particularly convenient as it is a realtime streaming 
5 industry and for which many tools are currently 
This type of processing is also required for the other 
different forms for each participant which come 

Not shown in Figure 3 are such functions 
level control, though their implementation follows 

10 provided herein. 

Specifically, pulse code modulated 
Participants C and D, which are connected to 
is the standard transmission form for audio in 
received from Participant B is already in RealAudio 

1 5 packets and is easily transported over IP, it is 
passing its signal to the RealAudio mixer 68. 

The RealAudio mixer 68 combines the i 
with the participant's requirements, In Figure 
line is shown which interfaces the RealAudio mpxer 

20 card 72. The audio output of the RealAudio mifcer 
well, and also to a PCM converter 74. 

This PCM convertor 74 feeds the echo 
more or less matches the output from the participant 1 
echo cancellor 76 can remove the speaker output 

25 up by the microphone 36. The PCM signal 
converted to RealAudio at the voice coder 78 

RealAudio packets are numbered sequentially 
arranged in the proper order when they are deqoded. 
to time stamp packets as the time delays are 

30 packets that result from their transport from different 
will not generally be detectable by the participants 
deliberately add a delay to the incoming signals 
signal jitter. As the data is arriving in finite and 
inevitably be some degree of jitter, so buffering 



cancellation/suppression may need 
lio has been chosen as the 
forms cpuld be used. RealAudio is 
standard that is well known in the 
available, such as codecs and mixers. 

listeners, and it takes slightly 
ate to their respective setups, 
as encryption, tone controls and 
logically from the description 



voice streams are received from 
RealAudio converters 64 and 66. PCM 
PSTN. Since the voice signal 

format, which comprises data 
necessary to convert it before 
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ijicoming audio signals in accordance 
, a bi-directional Activity and Control 
68 with the PC 26 via an Ethernet 
68 goes to the Ethernet card 72 as 

Cancellor 76 with an audio signal that 
's speaker 34. This way, the 
signal that is inadvertently picked 
the echo cancellor 76 is 



leaving 



to ensure that they are 
L Generally, it is not necessary 
; and the varied delays in data 
sources, or by different routes, 
In fact, RealAudio may 
by storing them in a buffer to absorb 
distinct data packets, there will 
is preferred. A buffer that causes 20 
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mSec - 50 mSec delay is sufficient time to absorb the effects of jitter most of the time, 
and is not long enough to annoy the users to a great extent. 

Also, note that all audio signals passing between the NetPort and NetPort 
Manager are in RealAudio form, and not PCjM. This makes the transport over the 
digital interconnection more convenient. 

Other preferred aspects of the teleconference are outlined as follows: 
1. Call Setup 

The most important aspect of call setup is the Identification of the participants, 
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where they can be found and then o 
preferred embodiment, the teleconfe 



course, creating the connections. In the 
5 'ence will be created by one or more of 
the participants who are GUI-enabled. These participants will advise the 
network of the identities of the participants and the call setup software on the 
network will make the connection wit i the participants. 
Some of the participants will have internet addresses, while others will have 
telephone numbers. In each case, ttte call setup software will investigate the 
participant and establish the best possible connection that it is aware of. 
Those participants without Internet atscess will be assigned proxies which 
reflect the resources they have access to. For example, if the call setup 
software identifies a PSTN telephone number as the best connection, it will 
assign a PSTN proxy to that participc nt unless advised otherwise. 
In the preferred embodiment, all part cipants who are GUI-enabled can add 
participants to the call, but for high-security teleconferences, addition of 



participants should be controlled by z 



the functionality in and implied by Fig 



This contrasts with the "pure Internet 
cooperating tasks in all of the various 
of the processing through an applicat 



calls, with no single program having m overview of the whole setup. This 



makes it very difficult to optimize and 



single participant 



Telecommunications Operating System 

The telecommunications operating system aspect of the invention provides 
unified control and access to all syste m resources and networking links, with 



ure 3. This represents a large collection 



of signal processing and control functions connected together in response to 
the commands of the callers. 



Protocol" approach which require 
computers to arrange to do their parts 
bn-specific protocol built on socket() 



manage the system, and each such 
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application has to reinvent call process 
part of the underlying implementation 
invention is built as a middleware layer 
In "telephony classic" one would not 
5 generality in software, but would make 

hardware that assumes all inputs and 
numbers to call in order to connect to it 
makes this a "closed system", in which 
limitations on who can develop new 

10 3. Proxy 

It is preferred that the invention be 
graph model as described in the co-| 
Patent Cooperation Treaty, Serial No 
for Telecommunications". While a si 

1 5 and manage the entire connection, 

well as voice streams, that application 
"agents" from "proxies". 
A proxy is a piece of software that acts 
connection. In this case Participants A 

20 and Internet providers are separate 

contains data that represents the 
whether Participant A is already on the 
network's 28 voice trunk is getting full, 
do specific tasks, such as responding tc 

25 managing a voice call in progress 

The terms proxy and agent are 
For the purposes of this document, they 
agents, each of which handles a specia 
not comprise an immense block of code 

30 in its simplest form, is merely a 

as required, discarding them when their 
These agents are sent to parts of the 
going on and are connected to the signal 
through* a "controlling application". This 



ng. The invention uses socket() as 
a preferred embodiment, so that the 
on top of IP. 
attempt to set up something with this 
special "stereo conferencing server" 
cjutputs are PCM and would add speciaT 
The need for specialized hardware 
nnovation is slowed down by 
telecommunications applications 

applied to a network which employs a 
pending patent application under the 
_, titled "Connection Manager 
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single application program is used to see 
including IP and videoCam connections as 
\$ constructed by collecting together 

on behalf of a specific party to a 
through D, and each of the networks 
parpes with separate proxies. A proxy 
and state of the party, such as 
elephone and whether the first active 
has components that are agents to 
off-hook on a telephone and 



preferences ; 



smd i 



used interchangeably in the art. 
are distinct: a proxy is built out of 
situation. Therefore, the proxy does 
with all conceivable functionality, but 
or which instantiates software agents 
tasks have been completed, 
system in which signal processing is 
processing code or hardware 
architecture of proxies, agents and 
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controlling applications is what allows connection management applications to 
appreciate the whole structure of a connection while still being "owned" by 
several different parties. 

Proxies should persist in the preser ce qf component failures, so that, for 
example, a user's forwarding instrui rtions do not get lost during a crash. It is 
preferred that persistence be provic ed via a distributed database which is 
continuously updated, so that all concerned parties are aware of the status of 
the communication. In the event of a failure, the system is able to work 
around the failure, allowing the conr munication to continue. Such 
transactional interaction techniques are known in the art. 
In the "pure Internet Protocol" approach there is only custom software running 
on hardware belonging to the various parties involved and communicating 
through socket() mechanisms in an ad hoc protocol. The invention builds an 
additional structure on top of this. 

In "telephony classic" there is a single very large program that looks at a 
database for all users and decides what they would want to do. This program 
is too large to modify quickly, and c an only be modified by the equipment 
manufacturer. Again, this approach is not flexible enough for rapid evolution 
of new features. The architecture cf the invention makes it easier to 
understand and modify software, without the same complexity, allowing the 
system to be open to software deve lopment, so that new features may be 
brought to market very quickly. 
Graph 

It is preferred that the invention be applied to a network which employs a 
graph model as described in the co pending patent application under the 



titled "Method and System for 



Patent Cooperation Treaty, Serial No. 

Configuring Communications Systems". Briefly, the graph model constructs 
the signal processing and communi ;ations structure as a mathematical graph, 

"filters" that implement the nodes out of 
by a dynamic linking process or by setting 
the IP addresses to which they make socket connections, to have the 
interconnection structure specified by the edges. This graph is also used for 
communication among the agents, as the data structure that defines a 



which is later implemented by taking) 
libraries and modifying them, either 
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connection. An API layer that describe s characteristics of these graphs is 
added above the raw graph structure to assist in writing agents. 
In addition to filters, it is also preferred 
calls to proxy agents required to set up 
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that this graph data packet contain 
the call. Proxies may also send their 



agents to collaborate on building and managing graphs. 
An application programming interface (API) converts a series of comparatively 
simple and high level functions into the lower level instructions necessary to 
execute those functions, simplifying us a of an operating system. Using 
Windows APIs, for example, a program can open windows, files, and 
message boxes, as well as perform me re complicated tasks, by executing 
single instructions. 

The particulars of how an API for the invention is implemented are not critical, 
but it is desirable that a standard API be employed that expresses control, 



connection and negotiation processes, 
standard API simplifies the creation of 



including payment. The use of a 
lew features by third parties. 



A GUI is particularly well suited to the use of a graph model, as the GUI may 
present the assembled filters as defined by a graph data packet, to the user in 



a very logical and understandable form 



It is also preferred that the GUI have 



the functionality to let the user modify tpe graph data packet simply by altering 
filters and their interconnections. 
In the "pure Internet Protocol" approach the overall communications structure 
is not visible at all, while in the "telephony classic 0 approach it is possible for 
switch software to connect physical poits together, but not to pull functions out 
of libraries. The decision about what p Drts to connect together is explicitly 
made by the users by dialling telephony numbers in the "telephony classic" 
approach. 

Real Time Operating System (RTOS) 

Voice teleconferencing is a real time prDcedure, so RTOS should be used as 
known in the art. Generally, RTOS's divide code to be executed into smaller 
units of threads and functions, and then schedule the execution of these 
threads and functions to be performed prior to specific deadlines. 
Distributed Operating System 
A distributed operating system is one irk which portions of the software can run 
on different nodes. In the case of a telecommunications system, distribution 
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of software makes it easier to maintain real time operation as there are more 
options available to schedule timely execution. As well, distributed operation 
improves scalability and speed. Tho use of agents and proxies lends itself to 
the efficient use of a distributed system, in that agents and proxies may be 
assigned to run on different nodes of the system. Ideally, agents will be 
located close to where they are required, to minimize time delays in 
communicating with the entities they represent Such a suitable distributed 
RTOS is described in the co-pendin; j patent application under the Patent 
Cooperation Treaty, Serial No. I .titled "Distributed Telco". 



Figure 4 presents a block diagram of an exemplary operating system 
architecture in a preferred embodimsnt of the invention. A distributed 
communications substrate 80 is interposed between user processes and the 
underlying machines, so that processes can generally be moved from one 
machine to another without being av/are of it, either to distribute load or to 
recover from failures. 

Processes running in the system coine from different sources and accordingly 
get different treatment in terms of the trade-off between security and 
performance. Call-processing functions acting on behalf of the end users run 
in a protected "sandbox" environment on a virtual machine. Those working 
on behalf of the network provider may run there, but may also be 
implemented directly as processes running on the network operating system. 
User processes running as "filters", with the hard real-time demands that 
come from being in the signal path, and also run directly on the 
communications; substrate 80. Processes belonging to different users are 
protected from each other by the us jal operating system mechanisms such 
as memory mapping and file privileges, but the source is also reviewed by the 
network administrator. Filter processes on the same machine and part of the 
same call may share an address space and a thread of control, with data 
being passed with a function call mechanism and with connections to other 
hardware being handled by a stub that adapts a function call to a socket-type 
mechanism. These filters would still be dynamically linked, even with the 
function-call mechanism. 

Signals pass through filter processed F, which also implement drivers and 
performance-sensitive functions on behalf of the network. Call processing on 
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behalf of users is handled by CP pre cesses running in a secure virtual 
machine VM environment, which also includes checkpointing functions that 
can transfer control on failure to a "ghost machine" 82. All these processes 
run on the common software communications layer 80, which places them on 
appropriate physical systems and arranges for their connections. Server 
processes S also run on the commu iications substrate 80, but do not have " 
the hard real-time constraints of the filter processes. Secure call-processing 
functions are one type of server process. 
Mapper 

The allocation of tasks and network i capacity to different communications 
graphs is done by an optimizer called the mapper. There are in general 
many ways that a graph representing a desired communication can be 
allocated to a physical network: each of the filters can run on several different 
nodes, for example, and sometimes there are several types of links over 
which data can be carried. The simplest embodiment uses hints from the 
proxies about where to put radio link s, that is, after the voice coder, and then 
applies a "greedy algorithm 0 to put computing resources as close to the net- 
work edge as possible. A good mapper should be a distributed application in 
which local decisions are made with ocal knowledge to the greatest extent 
possible. It is not essential to get a global optimum, as long as resources are 
not seriously wasted. 

In the case of the invention, the mosi significant resource management 
problem is the handling of the voice streams. In having a separate mixer for 
each of N participants, each mixer will receive (N-1) voice streams. The 
mapper must balance the benefit of distributing the mixers among various 
processors against the extra cost of transporting redundant audio signals. 
The factor that governs these decision in the implementation of the invention 
is quality of service (QoS). Methods of distributing such real time loads are 
known in the art, but in the case of the invention, the solution will vary with 
each set of participants and network fopography. 
Negotiation 

it is preferred that the architecture fo* agents provide for use of the negotiation 
system described in the co-pending patent application under the Patent 
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Negotiating Telecommunication 
Many users are competing at once 
(including its computing capacity), 
resolve this contention: agents for 
payment as part of connection setub 
all parties have accepted it. A callejr 
reduced quality level if the cost 
high. For example, on Christmas 
users can expect to get through 
a busy signal. 

In "pure Internet Protocol", temporajry 
efforts" basis and packets may be 
longer time-scale by overprovisioniihg 
frequent. With differential service s 
defined, but the definition needs to 
invention can be used to manage 
access in such a way as to permit 
In "telephony classic" contention is 
first served) and again the network 
too frequent 

Negotiation management may be 
for each of the user terminals and 
networks. Each negotiation agents 
telecommunications system, for 
represents the interests of its 
over the telecommunications network, 
the negotiation and then passing a 
proposed connection, to each 
negotiation agent may either accept 
proposal to the other negotiation 
When all or part of the graph data 
assembles the listed filters in the 
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titled "Method and System for 



or the shared resources of the network 
t is preferred to apply a market model to 

parties involved offer and demand 
, and a connection does hot happen uritil 
can choose to try a connection at a 
high quality connection becomes too 
, the load may be temporarily high, so 
reduced voice quality rather than getting 



congestion is resolved on a "best 
4>most arbitrarily thrown out, and at a 
the network so that failures are not too 
small number of priority classes are 
je managed. The market model of the 
differential service, allocating high priority 
guarantees on service, 
managed by call admission (first-come, 
is overprovisioned so that failures are not 



implemented by having a negotiation agent 
each of the multiple telecommunications 
is operable to execute somewhere on the 
example, on the active network, and 

ive party in negotiating communication 
This is done by identify participants in 
graph data packet which describe the 
participant for their consideration. Each 
, reject or revise it to make a new 
adents. 



jacket is to be executed, a device simply 
manner defined in the graph data packet. 
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It is also preferred that the invention be 
mechanism that protects proxies from erroneous 
As well, it is presently desirable that proxies 
another language with similar advantages cou 
5 include: 

a. excellent security 

b. a large community of experienced 

c. object oriented code structure 

d. simple net-based distribution 
10 A telecommunications system implemented 

above provides a foundation for the mixed media 
greater flexibility and power to existing service^ 
and Internet gaming. 

Other options for implementation of the 

15 1. Companding 

Companding techniques use "compression 1 
(smoothly) so as to keep a signal's levejl 
algorithms that adjust gains to < 
techniques used in audio are frequency^ 

20 companding which adjusts filter cutoffs 

signal levels are low. 
An extreme example of expansion is 
level below a certain threshold are turned 
noise. In telephony the most common 

25 opposed to "cancellation", in which the 

its gain reduced, which reduces the 
oscillation. Companders use around 
Instantaneous companders work on a 
common A-law case is covered under" 

30 2. Voice coding 

Voice coders are used to reduce the 
signals. There are many types, but brojadly 
minimizing some mathematical measun 
source; or they can model what the ear 
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implemented- with a strong security 
or malicious code in other proxies, 
arid agents be written in Java™, but 
d also be used. Advantages of Java™ 



developers 

mechanism 

with the functionality described 
applications of the future, and for 
such as high bandwidth telephone, 

invention include: 

algorithms that try to adjust gains 
more constant and "expansion" 
te signal-level variations. Some 
dependent, such as Dolby 
to suppress background hiss when 



squelch" in which signals with power 
off completely to minimize idling 
variant is "echo suppression", as 
signal path from the quieter user has 
gain for echoing and feedback 
operations per sample, 
dample-by-sample basis, and the 
coders" below. 

bandwidth requirements for voice 

they can apt on the waveform, 
like error power; they can model the 
will notice. Coding for compression is 
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minirrum 



tO! 



i variant 



an active research area, and a 
appear. 

'Telephony classic" uses waveform 
law). Sampling is done at 8kHz on 
300Hz to 3300Hz. The passband 
on speech quality and intelligibility, 
33% margin over the Nyquist 
prefilter costs. A-law and p-Iaw are 
representations, chosen as a way 
a wide range of signal levels. By 
stereo 16-bit fixed-point sampled at 
the bandwidth and the use of a T1 
sample to sample, the same quality 
with ADPCM (adaptive differential 
speaking, digitizes the derivative 
Most digital cell-phones use a 
model the incoming sound in terms 
cords or airflow and which in turn d 
requires less bandwidth than 
more slowly than the waveform, bu 
speech or even for speech in a 
are an active topic of research and 
output rates anywhere from 4kb/s 
SOMIPs of DSP capacity. Coders 
and hence add at least that much 
Source coders typically try to detect 
silence typically saves about 50% 
side it is conventional to replace 
listeners know the connection is sti 
Source coding is difficult to use for 
model a large number of different 
early digital audio sgch as CD and 
enough bandwidth and dynamic 
ear. Minidisc and digital compact 



steady stream of new coders is likely to 



coding in the form of 8kHz A-law (or 
a signal filtered to pass the range from 

defined to get good subjective scores 
and the sample rate is designed with a 
in a trade-off between network and 
specialized 8-bit floating-point 
get roughly constant signal-to-noise over 
comparison, compact disc (CD) sound is 
44.1kHz, which requires roughly 24 times 
ine. Because speech varies slowly from 
can be had for roughly half the bandwidth 
riulse-code modulation) which, roughly 
instead. 
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of linear prediction coding, which tries to 
of a sound source that simulates the vocal 
ives a filter that models the larynx. This 

coding because the larynx moves 
works badly for anything other than 
environment. These "source coders" 
currently produce tolerable speech at 
. A typical modern coder uses about 
topically operate on 20msec frames of data, 
to the signal path, 
silence, and avoiding the transmission of 
bandwidth on average. At the decoding 
with "comfort noise" so that the 
Hive. / 

music, because it would be necessary to 
instruments alone and in combination, so 
DAT, just used waveform coding with 

to satisfy (more or less) the human 
dassettes brought in coding that reduced 



up 



celay 1 



cif 



silence 1 



range 
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CD bandwidth by a factor of about 10 b/ using psychoacoustics, 
Psychoacoustics applies, in particular, {nasking effects, where loud tones 
mask nearby ones for normal ears, and 
transmitting the inaudible components, 
rate-adapted, as in RealAudio, and is a 



speech applications in the system of tho invention 



Conventional filters, companders and s 



coded signal, so it is standard to decorr press before filtering. In some cases 
this may be avoided, for example, N-ws y combining can take advantage of 
silence to do companding at no additional cost of bandwidth, and only needs 
to decode and recode during bi-directional conversation. 
MPEG (Motion Picture Experts Group) 

MPEG coders do the same type of thin|j for video signals that perceptual 
coders do for music. Components of a 

frequencies are digitized at low resolution, using 8*8 discrete cosine 
transforms to do the filtering, and using "motion estimation" so that 
components of an image that can be derived from adjacent frames are not 
retransmitted. MPEG decoding is prefe rably left for the end-user's PC, 
because It is very demanding and because specialized hardware exists for it. 
However, the traffic properties are an inportant consideration in implementing 
the invention. Straight digitized television requires roughly 30 frames / sec * 



200 kpixels / frame * 3 colours * 8 bits / 
what 3G wireless is built to handle, but 
hence the 3G requirement for that rate. 



capacity when the image changes sudc enly. 
At the low-quality end, videoconferencing is usually done at 128kb/s. At this 
rate the coding process adds hundreds of msec of delay and the picture is 
poor. 

If there is high demand for full-motion video, then 5MHz slots will not have 
sufficient capacity, but 20MHz slots antjl generous use of antenna diversity 
could support 10-40 users at that rate. 
The network operating system could initiate processes in the end-user's PC 
so that video services can be set up easily. 
Other applications 



bandwidth can be saved by not 
This type of technique can also be 
good candidate for high-quality 



milar components will not work on a 



colour, for 144Mb/s. That is beyond 
MPEG2 gives similar quality at 2Mb/s; 
MPEG2 Is also bursty, needing more 
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Other applications such as animated 
locations, voice activation, automat 
signal shaping to compensate for the 
or software in the system, are all 
5 invention. 

Examples have been shown to 
but the number of variations is by no 
could be made for any telephony device, 
machines, pagers, point of sale computers], 
10 private branch exchanges. While particular 
have been shown and described, it is clear 
made to such embodiments without departing 
invention. 

The invention could also be 
1 5 Internet and PSTN networks. For exampl^, 
the functionality of the invention similar to 
PSTN a specialized server could be attached 
implementations would not have all the 
certain aspects of its teachings. 
20 The method steps of the invention 

machine code stored in a variety of formatjs 
Such code is described geneiically herein 
program for simplification. Clearly, the 
with the code of other programs, implemented 
25 calls or by other techniques as known in 
The embodiments of the invention 
or similar device programmed in the 
an electronic system which is provided 
Similarly, an electronic memory means su 
30 Access Memory (RAM), Read Only Memojry 
storage media known in the art, may be 
As well, electronic signals representing 
via a communication network. 



i the 



manner 



I witi 
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video, stereo input at the participant's 
c gain control (AGC) at the user's PC, and 
frequency response of certain devices 
Mown in the art, and easily applied to the 

demonstrate various aspects of the invention, " 
means complete. Comparable implementations 
including personal digital assistants, fax 
amateur radios, local area networks or 
embodiments of the present invention 
that changes and modifications may be 
from the true scope and spirit of the 

implemented to a lesser extent on existing 

, Internet servers could be given much of 
j applications such as NetMeeting. On the 
to a class 5 switch. These 
behefits of the invention, but could apply 

nay be embodied in sets of executable 
such as object code or source code, 
as programming code, or a computer 
executable machine code may be integrated 
as subroutines, by external program 
art. 

may be executed by a computer processor 
of method steps, or may be executed by 
means for executing these steps, 
h computer diskettes, CD-Roms, Random 
(ROM) or similar computer software 
programmed to execute such method steps, 
these method steps may also be transmitted 
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It would also be clear to one skilled in the art that this invention need not be 
limited to the described scope of computers and computer systems. The principles of 
the invention could be applied to citizen's band radio, amateur radio, or packet radio. 
Again, such implementations would be clear tcf> one skilled in the art, and do not take 
away from the invention. 
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WHAT IS CLAIMED IS: 

1 . A system for teleconferencing comprising: 

three or more user terminals, each having an audio input and an audio output; 
a telecommunications network interconnecting said user terminals and operable to 

transport data to and from said use * terminals; 
separate modular mixing software for each respective user terminal, executing on - 

said telecommunications network, t nd operable: 

to receive separate audio signals from said audio outputs of the others of said 

user terminals; and 
to combine said separate audio signals into a signal for said audio input of 

said respective user terming I which correlates to the needs of said 

respective user terminal. 



aid 



each 



2. A system as claimed in claim 1, further 
modular connection management software 
said three or more user terminals 
including a connection proxy for 
said telecommunications network; 
each of said connection proxies executing 
to represent its owner's interests in 
recognizing the limitations o 



cind 

on said system and being operable: 
managing the teleconference by 
its resources. 



3. A system as claimed in claim 2, further 
a mapper for locating said separate modular 
terminal for execution on different 
network, trading off delay time in 
computational power available in orjder 



A system as claimed in claim 3, wherein 
comprises multiple telecommunications 
and protocols, each of said multiple 
own connection proxy. 



comprising: 
for establishing interconnections between 
said separate modular mixing software, 
of said three or more user terminals and 



comprising: 
mixing software for each respective user 
rbuters of said telecommunications 
communicating data between routers with 
to maintain quality of service. 



said telecommunications network 
networks with varied transport media 
telecommunications networks having its 
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5. A system as claimed in claim 4, further 
negotiation management software including a 

terminals and said multiple telecom munjications 
negotiation agents being operable: 
to execute on said system; and 
to representing the interests of each of 
negotiating communication over 
said negotiation management software being 

to identify negotiation agents participating 
to implement a negotiation discipline which 
negotiation agent to consider a 
accept or revise said communication 
to respond to said negotiation being 
communication contract. 



A system as claimed in claim 5, whereirji 
software is operable to combine said 
channels which define a metaphorical 
having a simulated position within said 
individual users may be recognized by 



A system as claimed in claim 6 t wherein 
from said respective user to emphasize 
the corresponding audio signal prior to 



combining. 



A system as claimed in claim 7, whereiip 
comprises a personal computer having 
speakers, and said respective mixer software 
separate signals into two audio channe 
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comprising: - 

Negotiation agent for each of said user 
networks, each of said 



jaid three or more user terminals in ~ 
said telecommunications network; 
oberable: 

in a negotiation; 
allows each said participating 
Communication contract and either 
contract; and 
successful by executing said 



said separate modular mixing 
separate signals into two or more audio 
physical space, each user terminal 
metaphorical physical space whereby 
heir particular position in said space. 



said mixer is responsive to a request 
a particular user's voice by amplifying 



at least one of said user terminals 
a stereo sound card and stereophonic 
is operable to combine said 
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A system as claimed in claim 8, wherein 
comprises a connection to a PSTN 
lines, and said respective mixer software 
separate signals into two audio channels 



10. 



A system as claimed in claim 9, whprein each said connection proxy 
comprises: 
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at least one~of said user terminals 
network via two monophonic telephone 
is operable to combine said 



multiple software agents each being opera Die to perform a specific task; and 
a proxy object operable to instantiate particular ones of said multiple software agents 

in response to requirements of communications made over said 

telecommunications system. 



11. A server for teleconferencing comprising 
means for interconnecting user terminals 
terminals; 

means for executing separate modular mining 
terminal, said separate modular mixing 
means for receiving separate audic 
others of said user terminals 
means for combining said separate 
input of said respective use 
said respective user termini! 



qnd transporting data to and from said user 

software for each respective user 
software including: 
signals from said audio outputs of the 
; and 

audio signals into a signal for said audio 
terminal which correlates to the needs of 



12. A method of teleconferencing compnsing 
receiving, at a separate modular mixer representing 
more user terminals and executing 
separate audio signals from audio 
and 

combining said separate audio signals ink 
respective user terminal which 
terminal. 



the steps of: 

a respective one of three or 
on a telecommunications network, 
outputs of the others of said user terminals; 

a signal for an audio input of said 

to the needs of said respective user 



con elates 
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13. A computer data signal embodied in a 
comprising a 

to perform the steps of claim 12. 



(parrier wave, said computer data signal 

a computer 



set of machine executable code being executable by 



PCT/CA99/00875 



14. 



A computer readable storage medium sitoring a set of machine executable 
code, said set of machine executable CDde being executable by a computer 
server to perform the steps of claim 12, 
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FIGURE 3 
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